Building Data Integration Systems: A Mass Collaboration Approach

نویسندگان

  • AnHai Doan
  • Robert McCann
چکیده

Building data integration systems today is largely done by hand, in a very labor intensive and error prone process. In this paper, we describe a conceptually new solution to this problem: that of mass collaboration. The basic idea is to think about a data integration system as having a finite set of parameters whose values must be set. To build such a system, the system administrators can construct and deploy a system “shell”, then ask the users to help the system “automatically converge” to the correct parameter values. This way, the enourmous burden of system developments is lifted from the administrators and spread “thinly” over a multitude of users. We discuss the challenges to this approach and propose solutions. We then describe our current effort in applying this approach to the problem of schema matching in the context of data integration.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Data Integration Systems via Mass Collaboration

Building data integration systems today is largely done by hand, in a very labor-intensive and error-prone process. In this paper we describe a conceptually new solution to this problem: that of mass collaboration. The basic idea is to think about a data integration system as having a finite set of parameters whose values must be set. To build such a system the system administrators construct a...

متن کامل

An Agent Approach to Data Sharing in Virtual Worlds and CAD

This paper describes an agent approach to sharing and synchronising building model data among CAD and virtual world systems. Virtual worlds facilitate a level of communication and collaboration not readily available in conventional CAD systems. The integration of virtual worlds and CAD systems using a common data model can make a significant impact on synchronous collaboration and real time mul...

متن کامل

Automatic Building Information Model Query Generation

Energy efficient building design and construction calls for extensive collaboration between different subfields of the Architecture, Engineering and Construction (AEC) community. Performing building design and construction engineering raises challenges on data integration and software interoperability. Using Building Information Modeling (BIM) data hub to host and integrate building models is a...

متن کامل

Critical Success Factors for Data Virtualization: A Literature Review

Data Virtualization (DV) has become an important method to store and handle data cost-efficiently. However, it is unclear what kind of data and when data should be virtualized or not. We applied a design science approach in the first stage to get a state of the art of DV regarding data integration and to present a concept matrix. We extend the knowledge base with a systematic literature review ...

متن کامل

Design of Oil Refineries Hydrogen Network Using Process Integration Principles

This paper describes the application of process integration principles to the design of oil refineries hydrogen network. In this regard, a design hierarchy as well as heuristics and required guidelines are proposed. The recommended rules compensate lack of procedure to the design and make the design process easier. The guiding principles of the design are based upon pinch technology and ext...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003